Pitfalls and Important Issues in Testing Reliability Using Intraclass Correlation Coefficients in Orthopaedic Research
نویسندگان
چکیده
BACKGROUND Intra-class correlation coefficients (ICCs) provide a statistical means of testing the reliability. However, their interpretation is not well documented in the orthopedic field. The purpose of this study was to investigate the use of ICCs in the orthopedic literature and to demonstrate pitfalls regarding their use. METHODS First, orthopedic articles that used ICCs were retrieved from the Pubmed database, and journal demography, ICC models and concurrent statistics used were evaluated. Second, reliability test was performed on three common physical examinations in cerebral palsy, namely, the Thomas test, the Staheli test, and popliteal angle measurement. Thirty patients were assessed by three orthopedic surgeons to explore the statistical methods testing reliability. Third, the factors affecting the ICC values were examined by simulating the data sets based on the physical examination data where the ranges, slopes, and interobserver variability were modified. RESULTS Of the 92 orthopedic articles identified, 58 articles (63%) did not clarify the ICC model used, and only 5 articles (5%) described all models, types, and measures. In reliability testing, although the popliteal angle showed a larger mean absolute difference than the Thomas test and the Staheli test, the ICC of popliteal angle was higher, which was believed to be contrary to the context of measurement. In addition, the ICC values were affected by the model, type, and measures used. In simulated data sets, the ICC showed higher values when the range of data sets were larger, the slopes of the data sets were parallel, and the interobserver variability was smaller. CONCLUSIONS Care should be taken when interpreting the absolute ICC values, i.e., a higher ICC does not necessarily mean less variability because the ICC values can also be affected by various factors. The authors recommend that researchers clarify ICC models used and ICC values are interpreted in the context of measurement.
منابع مشابه
Reliability of 2 ultrasonic imaging analysis methods in quantifying lumbar multifidus thickness.
STUDY DESIGN Reliability study. OBJECTIVES To compare the within- and between-day intrarater reliability of rehabilitative ultrasound imaging (RUSI) using static images (static RUSI) and video clips (video RUSI) to quantify multifidus muscle thickness at rest and while contracted. Secondary objectives were to compare the measurement precision of averaging multiple measures and to estimate rel...
متن کاملPhysical performance assessment in military service members.
Few established measures allow effective quantification of physical performance in severely injured service members. We sought to establish preliminary normative data in 180 healthy, active-duty service members for physical performance measures that can be readily implemented in a clinical setting. Interrater and test-retest reliability and minimal detectable change (MDC) values were also deter...
متن کاملCross-cultural adaptation and validation of the Japanese Knee Injury and Osteoarthritis Outcome Score (KOOS).
BACKGROUND In Japan, only few cross-culturally adapted, internationally used orthopaedic patient self-assessed outcome scores are available. In addition, the high incidence of knee osteoarthritis (OA) suggests the need for validated outcome measures such as the widely used Knee Injury and Osteoarthritis Outcome Score (KOOS) for Japanese populations. The purpose of this study was to provide a cr...
متن کاملبررسی روایی و پایایی نسخه فارسی پرسشنامه «کیفیت زندگی افراد ناتوان سازمان بهداشت جهانی» (WHOQOL-DIS) در افراد سالمند ناتوان
Objective: The main purpose of the present study was to evaluate psychometric properties of Persian version of WHOQOL-DIS questionnaire in elderly people with disability. Materials & Methods A classical psychometric method was used to evaluate validity and reliability of WHOQOL-DIS questionnaire in elderly people with disability. Lawshe, and Waltz and Bausell methods were used for assessing ...
متن کاملData Driven Approaches to Testing Homogeneity of Intraclass Correlation Coefficients
The test of homogeneity for intraclass correlation coefficients has been one of the active topics in statistical research. Several chi-square tests have been proposed to test the homogeneity of intraclass correlations in the past few decades. The big concern for them is that these methods are seriously biased when sample sizes are not large. In this thesis, data driven approaches are proposed t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 4 شماره
صفحات -
تاریخ انتشار 2012